Skip to main content

Correcting the database

We go back to the original data collection forms and find out that participant 6 has a disease duration of 2 years, and is aged 37.

Correct this on your database.

If we really want to check that the data is correct we can write a formula to calculate disease duration.

Insert a new column - put the cursor on the cell saying Group

  • press insert (from the menu at the top)
  • select column

In this new column write = d2-c2

This should calculate "age" - "age at first symptoms"

Paste this formula down the entire column.

This will give us the disease duration.

1

Is the data right now for the disease duration column?

a)
b)

It should be alright now.

We have now finished cleaning the data. If this was a real database we could check every 10th or other random check to make sure it was entered properly

It should be all right now. If this was a real database we could check every 10th or other random check to make sure it was entered properlyYour answer has been saved.
Check your answer